Discriminant Analysis for Unsupervised Feature Selection
نویسندگان
چکیده
Feature selection has been proven to be efficient in preparing high dimensional data for data mining and machine learning. As most data is unlabeled, unsupervised feature selection has attracted more and more attention in recent years. Discriminant analysis has been proven to be a powerful technique to select discriminative features for supervised feature selection. To apply discriminant analysis, we usually need label information which is absent for unlabeled data. This gap makes it challenging to apply discriminant analysis for unsupervised feature selection. In this paper, we investigate how to exploit discriminant analysis in unsupervised scenarios to select discriminative features. We introduce the concept of pseudo labels, which enable discriminant analysis on unlabeled data, propose a novel unsupervised feature selection framework DisUFS which incorporates learning discriminative features with generating pseudo labels, and develop an effective algorithm for DisUFS. Experimental results on different types of real-world data demonstrate the effectiveness of the proposed framework DisUFS.
منابع مشابه
Spectral clustering and discriminant analysis for unsupervised feature selection
In this paper, we propose a novel method for unsupervised feature selection, which utilizes spectral clustering and discriminant analysis to learn the cluster labels of data. During the learning of cluster labels, feature selection is performed simultaneously. By imposing row sparsity on the transformation matrix, the proposed method optimizes for selecting the most discriminative features whic...
متن کاملStructure Regularized Unsupervised Discriminant Feature Analysis
Feature selection is an important technique in machine learning research. An effective and robust feature selection method is desired to simultaneously identify the informative features and eliminate the noisy ones of data. In this paper, we consider the unsupervised feature selection problem which is particularly difficult as there is not any class labels that would guide the search for releva...
متن کاملA Nonlinear Grayscale Morphological and Unsupervised method for Human Facial Synthesis Based on an Example Image
Human facial generation of example image is used as a requirement for biometric applications for the purpose of identifying individuals. In this paper, face generation consists of three main steps. In the first step, detection of significant lines and edges of the example image are carried out using nonlinear grayscale morphology. Then, hair areas are identified from the face of sample. The fin...
متن کاملUnsupervised Feature Selection Via Hypergraph Embedding
Most existing feature selection methods focus on ranking individual features based on a utility criterion, and select the optimal feature set in a greedy manner. However, the feature combinations found in this way do not give optimal classification performance, since they tend to neglect the correlations among features. In an attempt to overcome this problem, we develop a novel unsupervised fea...
متن کاملUnsupervised Segmentation of Medical Images using DCT Coefficients
Image segmentation is a prerequisite process for image content understanding and visual object recognition in medical images for the development of a computer aided diagnosis(CAD) system. An unsupervised segmentation method is proposed which uses discrete cosine transform(DCT) coefficients for extraction of feature vectors and the Fisher Discriminant K-means (FDK) technique for clustering image...
متن کامل